Single channel speech separation using maximum a posteriori estimation
نویسندگان
چکیده
We present a new approach for separating two speech signals when only a single recording of their additive mixture is available. In this approach, log spectra of the sources are estimated using maximum a posteriori estimation given the mixture’s log spectrum and the probability density functions of the sources. It is shown that the estimation leads to a two-state, non-linear filter whose states are controlled by the means of the sources. The first state of the filter is expressed using a combination of two Wiener filters whose parameters are controlled by the means and variances of the sources and noise variance and the second state is expressed by the means of the sources. Through the experiments, conducted on a wide variety of mixtures, we show that the MAP based estimator outperforms the methods which use binary mask filtering or Wiener filtering for the separation task.
منابع مشابه
Noise Reduction by Maximum a Posteriori Spectral Amplitude Estimation with Supergaussian Speech Modeling
ESTIMATION WITH SUPERGAUSSIAN SPEECH MODELING Thomas Lotter and Peter Vary Institute of Communication Systems and Data Processing ( ) Aachen University (RWTH), Templergraben 55, D-52056 Aachen, Germany E-mail: lotter vary @ind.rwth-aachen.de ABSTRACT This contribution presents a spectral amplitude estimator for acoustical background noise suppression based on maximum a posteriori estimation and...
متن کاملSingle Channel Audio Source Separation
-Blind source separation is an advanced statistical tool that has found widespread use in many signal processing applications. However, the crux topic based on one channel audio source separation has not fully developed to enable its way to laboratory implementation. The main idea approach to single channel blind source separation is based on exploiting the inherent time structure of sources kn...
متن کاملA Generalized Approach for Model-based Speaker-dependent Single Channel Speech Separation
Abstract– In this paper, we present a new technique for separating two speech signals received from one microphone or one communication channel. In this special case, the separation problem is too ill-conditioned to be handled with common blind source separation techniques. The proposed technique is a generalized approach to model-based speaker-dependent single channel speech separation techniq...
متن کاملDynamic channel compensation based on maximum a posteriori estimation
The degradation of speech recognition performance in real-life environments and through transmission channels is a main embarrassment for many speech-based applications around the world, especially when non-stationary noise and changing channel exist. In this paper, we extend our previous works on Maximum-Likelihood (ML) dynamic channel compensation by introducing a phone-conditioned prior stat...
متن کاملTitle of Document : MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING
Title of Document: MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING Vijay Mahadevan, Master of Science, 2010 Directed By: Dr. Carol Y. Espy-Wilson Department of Electrical and Computer Engineering The aim of the work presented in this thesis is to automatically extract the fundamental frequency of a periodic signal from noisy observations, a task commonly referred to as pitch estim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007